Defining User Profile to Improve Knowledge Extraction in a Digital Library of Scientific Documents

نویسندگان

  • Rocío Abascal-Mena
  • Béatrice Rumpler
  • Suela Berisha-Bohé
چکیده

Annotation is a key way in which documents grow and increase in value. This paper explores the possibility to use concepts extracted from documents by using a Natural Language Processing tool to characterize the content of digital theses. Then, using the results of the study, the paper explores the use of annotated theses in order to access to pertinent information stored in these documents and to extract knowledge by defining different user’s profiles.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Level of Observing the Evaluation Criteria for User Interface in library services providing to the blind and deaf users in the word

Purpose: Digital library user interfaces has a determining role in desirable performance of this kind of libraries. Digital Library service providers to the blind and deaf users will have their best performance when the users (deaf and blind users) could have a proper interaction with them. This study aims to evaluate and analyze the criteria related to user interface in digital libraries servi...

متن کامل

Scientific Data and Document Processing in ChemxSeer

ChemXSeer is a digital library and a data repository for the chemistry domain. The data deposited into our repository is linked with digital documents to create aggregates of resources representing the links between the data and the articles in which the data is reported. ChemXSeer enables the user to annotate the data using a metadata capturing tool. The metadata is indexed and searched to ret...

متن کامل

کتابخانه‌ی ملی دیجیتال پزشکی ایران(INMDL) : بایدها و نبایدها

Iran National Digital Library of Medicine was launched in 2008 by Shahid Beheshti University of Medical Sciences in order to supply English language scientific resources for the Universities of Medical Sciences throughout the country. The Library could be accessed via www.inlm.org. Given the academic definition for national and digital libraries, it seems that the services and resources offered...

متن کامل

A New Domain Independent Keyphrase Extraction System

In this paper we present a keyphrase extraction system that can extract potential phrases from a single document in an unsupervised, domain-independent way. We extract word n-grams from input document. We incorporate linguistic knowledge (i.e., part-of-speech tags), and statistical information (i.e., frequency, position, lifespan) of each n-gram in defining candidate phrases and their respectiv...

متن کامل

Unsupervised and domain-independent extraction of technical terms from scientific articles in digital libraries

A central issue for making the contents of documents in a digital library accessible to the user is the identification and extraction of technical terms. We propose a method to solve this task in an unsupervised, domain-independent way: We use a nominal group chunker to extract term candidates and select the technical terms from these candidates based on string frequencies retrieved using the M...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006